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Abstract — A social network grows over a period of time with 
the formation of new connections and relations. In recent years 
we have witnessed a massive growth of online social networks 
like Facebook, Twitter etc. So it has become a problem of 
extreme importance to know the destiny of these networks. Thus 
predicting the evolution of a social network is a question of 
extreme importance. A good model for evolution of a social 
network can help in understanding the properties responsible 
for the changes occurring in a network structure. In this paper 
we propose such a model for evolution of social networks. We 
model the social network as an undirected graph where nodes 
represent people and edges represent the friendship between them. 
We define the evolution process as a set of rules which resembles 
very closely to how a social network grows in real life. We simulate 
the evolution process and show, how starting from an initial 
network, a network evolves using this model. We also discuss how 
our model can be used to model various complex social networks 
other than online social networks like political networks, various 
organizations etc.. 
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1. Introduction 

The analysis of online Social Networks has allowed 
us to answer many questions regarding the characteristics 
of network and how the network changes. Modeling the 
evolution of a social network and predicting the structure of 
the future network is a complex problem. Social networks 
grow and change quickly over time with the addition of new 
edges, signifying the appearance of new interactions/relations 
in the underlying social structure. In this work, we consider 
a social network as a network comprising of members which 
are connected in the network by the “friendship” relation. 
We try to understand the mechanisms by which the social 
network evolves over time and using this information we 
design a model which allows us to predict the structure of the 
future network. 

In order to model the evolution of a Social Network we 
have to know the network characteristics, as in many evolution 
studies, the underlying process for network change is assumed 
to be centered at the behavioral characteristics of the network 
members na. In a social network, network members tend 
to ’choose’ their friends by comparing relevant individual 
characteristics of the others with their own. A fundamental 
finding in many choice networks is that social members with 
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similar characteristics are more often connected with one 
another than with more dissimilar ones. This is known as the 
’’similarity effect” in social networks El. So we can infer that 
for any relationship between the network members of a social 
network there must be some common characteristics between 
those members which lead to that relation. This basic idea 
can be used to design the evolution model. In our work we 
call these common characteristics as factors. Eor example in 
online social networks like Eacebook we can observe certain 
factors namely frequency with which friends tag each other in 
their posts, place where they live, work place (school, office, 
university etc.), common interests (movies, songs, books) etc. 
These factors signify how similar the members of the social 
network are. Considering more factors we can come up with 
a model that can provide a more precise measure of the 
similarity amongst the individuals. It has also been shown in 
m that gender plays an important role in deciding the level of 
friendship in people thus gender can also be considered as a 
factor. The strength of the friendship depends on the number 
of common factors as well as the weight of these common 
factors. It means that we can associate the term ’’quality” with 
these relationships which shows how strong is the relationship 
among the members. 

In the following section we will discuss some prior work 
regarding evolution models. Subsequently, we will describe 
the model of evolution proposed by us. 

A. Related Work 

Several researchers have turned their attention to the evolu¬ 
tion of social networks at a global scale. Eor example networks 
become denser over time, in the sense that the number of edges 
grows super-linearly with number of nodes ||7|. In this paper 
they reported that the network diameter often shrinks over 
time, in contrast to the conventional concept that such distance 
measures should increase slowly as a function of the number of 
nodes. Some efforts has also been made in the direction to find 
the properties responsible for the network evolution. A variety 
of network formation strategies were investigated showing that 
edge locality plays a critical role in network evolution O. 
Many models have been designed to predict the links in social 
networks for example in jJI they introduced the notion of 
graph evolution rules in which they developed Graph Evolution 
Rule Miner(GERM) software to extract the rules responsible 
for network evolution and applied these rules to predict the 
future network. In the direction of basic principles responsible 
for social network evolution researchers have shown that the 


most important characteristic of social evolution is that the 
outcome of evolution process is not the result of central 
authority but are the consequences of the simultaneous choices 
of persons 1^ ifT^ ifTSll . Social actors try to realize their own 
goals by choosing between behavioral alternatives that are 
available to them under certain restriction CD. In ca they 
have reviewed, classified and compared different models of 
social networks. They have classified these models into two 
main categories, first in which the addition of new links 
depends on the local network ^imcimt{Network Evolution 
Models), and second in which probability of each link existing 
depends only on nodal attributes (Nodal Attribute Models). 
In their work, they have shown that Nodal Attribute Models 
produce very clear community structure. In next section we 
will discuss about our work and will describe our evolution 
model. 

II. Our Contribution 

In this section we discuss about the evolution model that 
we have proposed. The basic idea that we have used to 
design this evolution model is the transitive property among 
the relationships i.e if a person A is connected to a person 
B who in turn is connected to a person C, then there is 
high possibility that person A and C will be connected. We 
have designed the evolution model which uses this transitive 
property to evolve the network. In our model we represent 
the network as undirected graph such that nodes represent 
people and edges represent connections between them. These 
connections can be of any form of friendship relation like 
’’friends”, ’’acquaintance”, ”co-workers” etc. 

A. Preliminaries 

We have modeled the social network as a graph G = (V^E) 
where the nodes(y) represent the people present in the network 
and an edge e = (u^v) e E represents the friendship relation 
between them. As we have already discussed that for any 
social relation there must be some common characteristics 
(factors) which decides these relations. For this purpose we 
have associated factors with each edge. For simplicity, in 
our model we have assumed that the number of factors is 
finite. With each edge we associate certain factors which 
represent the common attributes between the nodes which 
are connected by that edge. Let F be a finite set of factors 
s.t F C N and Ej C F be the factors of an edge Cj for 
j = l,2,3,...,m where m is the total number of edges. To 
decide the ’’quality” of friendship we have associated score 
values with these factors. These values will decide at what 
degree the factors between the people are similar and the 
cumulative score value will be the measure of ’’quality” of 
friendship. For each factor in set Ej a score value s is given 
to it. For example if /i,/2,/s,/4, ••••,//c is a set of factors 
for an edge Cj then si, 52 , S 3 , 54 ,...., 5 /^ are the score values 
associated with these factors. The cumulative score value for 
each set of factors Ej on edge Cj is denoted by Sj which is 
equal to the Now we have defined the structure of 

our graph G = (F, F) in which V is the set of nodes and 
E is the set of edges and each edge Cj is associated with a 
set of factors Ej and a cumulative score value Sj for j = 
0,l,2,3,....,m where m is total number of edges in graph G. 
In next section we will describe the evolution process for our 
model. 


B. Process of Evolution 

Let at time i = 0, initial graph be Go and set of score 
values be scores which is the union of the all the scores of 
the edges present in Gq. At every time step we will try to 
add edges between the nodes according to the evolution rules. 
Suppose at time stamp i, Gi = (V^Ei) is the graph evolved 
from the initial graph then for all i > 0 , Gi = (V^Ei) is 
subgraph of G^+i = (F, F^+i) i.e. Ei C F^+i. According to 
our rules of evolution F^+i is obtained from E^. The set of 
scores of edges of Gi is denoted by Si = {suv\{u,v) G Ei}. 
In this model, once an edge is added to the graph during the 
evolution process, it is never deleted in any future time-stamp. 
Also the number of nodes in the graph is fixed. We assume 
that the set of factors Fg for an edge e will never change 
during the evolution process, i.e if Fg is the set of factors for 
an edge e at time step i then at any time step k > i,m Gk, E^ 
will remain to be the set of factors for the edge e. We have 
also considered a model in which we are allowed to add some 
nodes in the graph, but once added it will never be removed. 
In our next section we will describe the evolution rules on the 
basis of which the evolution of the network is done. 



Fig. 1. An example with six nodes 


C. Rules of Evolution 

In this section we present the rules of evolution of 
(Gi^scorci) which lead to some interesting properties of the 
social network graph. Starting from an initial social network 
these rules help in predicting the state of the social network 
after several steps. Our main assumption in the evolution 
process is that to add a new edge (u^v) say at the step 
there has to be another vertex w such that (u^w) and (w^v) 
are already existing edges in the graph. This modelling is 
based on a general real life observation that in order for 
two strangers to become friends there is usually a common 
friend. We also ensure that this edge is added if the cumulative 
score is more than a particular threshold t. We define this 
cumulative score by first taking the intersection of the sets 
E(^u,w) F(^ and then computing the maximum of 

over all possible w eV. 

Notice that in order to define the cumulative score we have 
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considered the arithmetic mean of the values , and 
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I and scaled them by the cardinality of the set of 

(w,v) \ 

common factors. The intuitive reason for this score function 
is as follows: 

• The quantity represents the score of sl factor 

for a particular edge and thus by taking an arithmetic 
mean of the two quantities we get the average score 
of Si factor that is common in both F(^u,w) ^^d 

• Multiplying the arithmetic mean by |F(n,w) n^(w,v)| 

scales this weight by the number of factors common 
to both and 

Notice that instead of taking arithmetic mean of the two 
quantities we can also take the geometric mean or harmonic 
mean to get an average score. Also despite the fact we can 
recommend an edge that has a cumulative score of an more 
than the threshold there is still a chance that two people might 
decide to not be friends because of some random uncorrelated 
event. Thus in order to take care of this issue we make 
our evolution rules randomized i.e. even though an edge has 
a cumulative score more than the threshold it is added in 
the evolution process with probability p and rejected with 
probability 1 — p. In the following we formally describe the 
rules of evolution of the network. 


Algorithm 1 Evolution Process 

i i — 0 

while no more edges can be added do 

for each (u^v) is not added, and V w such that 

{u, re), (re, v) G Ei find, 

2 ; = argmax(| • where k = Fuw f] 

u) Z t^uw tCwv 

if (I ■ (fcr + > * th®" 

add {u^v) with probability p and discard with prob¬ 
ability 1 — p. 

i ^ i 1 


In figure we can see the example of evolution. In this 
figure the network is made up of six nodes with starting 
connections marked as yellow, and after evolving this network 
we get the connections which are marked in red. For example 
V 2 is connected to V3 with the set of factors Fv 2 ,Vs as {2,3,4} 
and score value sv 2 ,V 3 as 9 and V 3 is connected to Ve with the 
set of factors Fv^y^ as {1,2,3,4} and score value as 10. 

So after applying the score function we get the intersection of 
factors as {2,3,4} and score value as 7. Since this calculated 
score value is greater than the threshold value (which is 6 ), 
this new link V 2 to Vq is added as a new connection. In our 
evolution model we add the edge with some probability. This 
process will repeat until there are no new edges that can be 
added. 

III. Experiments and Result 

To implement our evolution model we have considered 
some assumptions. The input to the evolution process is a 
graph which we called as ’’initial graph”, so to produce these 
initial graphs we designed a program which produces random 
graph by putting random edges between the nodes. So each 


time we require a graph we run this program to get an initial 
graph which will act as input to our evolution model. As we 
have described that the set of factors which are responsible 
for addition of edges can vary from person to person, we have 
factors on each edge randomly. So on each edge Cj 
we have assigned a set of factors Fj randomly from the set of 
totSil factors F. To decide the quality of ’’Friendship” we have 
assigned score values on each edge randomly as well. 



Fig. 2. Evolution process on 400 nodes using arithmetic mean 

Consider an experiment in which the initial graph has 400 
nodes and randomly placed edges. On each edge Cj the set of 
factors Fj that has been assigned is taken randomly from the 
set of SiW factors F = {1,2,3,..., 8 }. The score value on each 
edge Cj has been also taken randomly from the set of all score 
values S = {1, 2,3,4,..., 16}. On this initial graph we apply 
our evolution process to get the evolved graph. To visualize 
the graph we have used Gephi tool. In figure we can see the 
evolved graph. 

A. Experiment with different mean 



Fig. 3. Evolution process on 400 nodes using geometric mean 











In the previous experiment we have calculated the cumula¬ 
tive score value by taking the arithmetic mean of the individual 
score values. The new connections are made by calculating the 
cumulative score values of the common factors of a transitive 
relation. We consider another experiment on 400 nodes again 
with a random initial graph, but instead of taking the arithmetic 
mean of the score values we take their geometric mean to 
compute the cumulative score values. In figure the evolution 
does not show the formation of communities as was established 
by the previous experiment. 

B. Experiments with different number of nodes 

In this section we will show the experiment results for 
different number of nodes. 



Fig. 4. Evolution process on 500 nodes 



C. An iterative evolution process 

We have proposed another extension of our model in which 
rather than applying the evolution process to fixed number of 
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Fig. 6. Evolution process on 700 nodes 


Algorithm 2 Iterative Evolution Process 
while no more new nodes are added do 

i i — 0 

while no more edges can be added to Gi do 

for each {u^v) is not added, and V w such that 

ic), (ic, v) G Ei find, 

2 ; = arginax(^ • + ^)) where k = 

^ ^ t^uw K>WV 

(I • (fc + fc))- > t then 
add {u^v) with probability p and discard with 
probability 1 — p. 

i 4- i + 1 

add random set of nodes Vnew and a random set of edges 
connecting to Gi. 


nodes we are iteratively adding more nodes to the graph and 
applying the evolution process. This process will not terminate 
as nodes will be added at every stage of evolution so we have 
taken the snapshot of graph at time T. In this process random 
number of nodes from a new set of nodes Vnew are added 
at every time step T. These nodes are randomly connected to 
some nodes which are already present in the network and on 
this modified graph the evolution process is applied. So at some 
time step T we produce the graph which can be considered as 
a snapshot of the evolution process. The following algorithm 
shows the iterative evolution process. 

We have implemented the iterative evolution process on 
100 nodes. We can see in figure the graph on which the 
iterative evolution process has been applied. In this experiment 
we have taken the set of nodes Vnew as 20 from which random 
number of nodes are taken and are added to the graph at 
every time step, and the evolution process continues with the 
modified graph. Figure |7] shows the initial graph to which 
some random nodes are connected to it. Figure is the graph 
obtained after evolution when 10 nodes get added random 
to the graph in [ 7 ] at some time step. We can also observe 








formation of communities in iterative evolution process as 
shown by the networks evolved through the evolution process 
with fixed number of nodes. 



Fig. 7. Iterative Evolution process on 100 nodes 



Fig. 8. Evolved graph at time step T + k 


IV. Conclusion 

In this work we propose a model of evolution of social 
networks based on the transitive property of the growth of 
friendship relation. Our model is based on predicting the 
similarity of two people which satisfy the transitive property. 
We characterize a particular relation i.e. an edge in the social 
network graph, by a number of sociopsychological factors and 
a score value which measure the strength of the relationship. 
We observe that if we use a specific formulation to derive the 
strength of a recommended friendship using arithmetic mean, 
dense communities are formed which is a characteristic of real 
life social networks. Changing the nature of the formulation 
using other means doesn’t show formation communities. To the 
best of our knowledge such a model has not been proposed 
yet. We suspect that this model can be useful in understanding 
the evolution of complex social networks like online social 
networks, political networks, corporate networks etc. As part 
of future work, it would be interesting to see whether our 


model can make accurate predictions for real life social net¬ 
works in terms of formations of communities. It would also 
be interesting to prove theoretical guarantees on the results 
obtained in this paper which would in turn be an in depth 
study of the theoretical underpinnings of this work. 
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